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Abstract. The QQ plot is a commonly used technique for informally deciding whether a univariate random 
sample of size n comes from a specified distribution F. The QQ plot graphs the sample quantiles against 
the theoretical quantiles of -F and then a visual check is made to see whether or not the points are close to 
a straight line. For a location and scale family of distributions, the intercept and slope of the straight line 
provide estimates for the shift and scale parameters of the distribution respectively. Here we consider the 
set Sn of points forming the QQ plot as a random closed set in R^. We show that under certain regularity 
conditions on the distribution F, Sn converges in probability to a closed, non-random set. In the heavy 
tailed case where 1 — _F is a regularly varying function, a similar result can be shown but a modification is 
necessary to provide a statistically sensible result since typically F is not completely known. 



1. Introduction 

Given a random sample of univariate data points, a pertinent question is whether this sample comes from 
some specified distribution F. A variant question is whether the sample is from a location/scale family 
derived from F. Decision techniques are based on how close the empirical distribution of the sample and the 
distribution F are for some sample size n. The empirical distribution function of the iid random variables 
Xi,. . . ,Xn is 



1 

Fn(x) :— —y I(Xi < x), — oo < X < oo. 



n 



The Kolmogorov-Smirnov (KS) statistic is one way to measure the distance between the empi rical dist r ibutio n 
function and the distribution function F. Glivenko and Cantelli showed (see, for example, Serfline ( 1980l )) 



that the KS-statistic converges to almost surely. The QQ (or quantile-quantile) plot is another commonly 
used device to graphically, quickly and informally test the goodness-of-fit of a sample in an exploratory way. 
It has the advantage of being a graphical tool, which is visually appealing and easy to understand. The 
QQ plot measures how close the sample quantiles are to the theoretical quantiles. For < p < 1, the p*'' 
quantile of F is defined by 

(1.1) F^{p) -.^Mix : F{x)>p}. 

The sample p*'' quantile can be similarly defined as F^{p). If Xi-n < ^2:ri < ■ ■ • < Xn-.n are the order 
statistics from the sample, then F^{p) = X^np\:n- where as usual [np] is the smal lest integer gr eater than 
or equal to np. For < p < 1, X^np'\:n is a strongly consistent estimator of F^{p) ( Serflind . 1980l page 75). 



Rather than considering individual quantiles, the QQ plot considers the sample as a whole and plots the 
sample quantiles against the theoretical quantiles of the specified target distribution F. If we have a correct 
target distribution, the QQ plot hugs a straight line through the the origin at an angle of 45°. Sometimes 
we have a location and scale family correctly specified up to unspecified location and scale and in such cases, 
the QQ plot concentrates around a straight line with some slope (not necessarily 45°) and intercept (not 
necessarily 0); the slope and intercept estimate the scale and location. Since a variety of estimation and 
inferential procedures in the practice of statistics depends on the assumption of normality of the data, the 
normal QQ plot is one of the most commonly used. 
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Our goal here is to formally prove the convergence of the QQ plot (perhaps suitably modified) to a straight 
line. This would show the asymptotic consistency of the QQ plot. The QQ plot formed by a sample of size 
n can be considered a closed subset of denoted by 5„. This set of points that form the QQ plot in is 

(1.2) Sn := {iF^i^—),X.,,„), l<i<n} 

n + I 

where the function F^(-) is defined by (jl.ip . For each n, 5„ is a random closed set. Note that, if {5„} has 
an almost sure limit S then this limit set by the Hewitt-Savage — 1 law must be almost surely constant. 
A straight line (or some closed subset of a straight line) is also a closed set in . Under certain regularity 
conditions on F, we show that the random set 5„ converges in probability to a straight line (or some closed 
subset of a straight line), in a suitable topology on closed subsets of M^. 

Section [2] is devoted to preliminary results on the convergence of random closed sets. We also discuss 
a result on convergence of quantiles and, because of our interest in heavy tails, we introduce the concept 
of regular variation. In Section [31 we assume the random variables have a specified distribution F and we 
consider convergence of the random closed sets 5„ forming the QQ plot. In Section [H the idea of the QQ 
plot is extended to the case where we know that the data is heavy tailed, that is 1 — F is regularly varying. 
We assume we do not know the exact distribution of F; we presume the distribution is heavy tailed but do 
not know either the tail index or the slowly varying component. The usual QQ plot is not informative in a 
statistical sense and hence must be modified by a thresholding technique. 

In Corollarv l3.4l we have convergence of a log-transformed version of the QQ plot to a straight line when 
the distribution of the random sample is Pareto. Now Pareto being a special case of a distribution with 
regularly varying tail, we use the same plotting technique for random variables having a regularly varying tail 
after thresholding the data. We provide a convergence in probability result considering the k — k{n) upper 
order statistics of the data set where k oo and k/n 0. In Se ction [51 a continuity r e sult is provided for 



a leas t squares line through these special kinds of closed sets. See lBeirlant et al. ( 1996( ). Kratz and Resnick 



(|1996D 



2. Preliminaries 

2.1. Closed sets and the Fell topology. We denote the distance between the points x and y by d{x, y); 
T ^ Q and K, are the classes of closed, open and compact subsets of M'* respectively. These quantities are 
sometimes subscripted by the dimension of the space if this needs to be emphasized for clarity. We are 
interested in closed sets because the sets of interest such as 5„ are random closed sets. There are several 
ways to define a topology on the space of closed sets. The Vietoris topology and the Fell topology are 
frequently used and these are h i t-or-m i ss kinds of topologies. We sha l l disc uss the Fell topology below. For 
further discussion refer to Beer (1993), Matheron ( 19751 ). Molchanov ( 2005 ). 



For a set i? C M"^, define Tq as the class of closed sets hitting B and as the class of closed sets disjoint 
from B: 

Tb^{F -.F CT, f[\B^^, = {F:FeT, i^p|B==0}. 

Now the space can be topologized by the Fell topology which has as its subbase the families {T^ , K e K.} 
and {Tg, G e G}. 

A sequence {J^„} converges in the Fell topology towards a limit F in !F (written F„ F) if and only if 
it satisfies two conditions: 

(1) If an open set G hits F, G hits all F„, provided n is sufficiently large. 

(2) If a compact set K is disjoint from F, it is disjoint from F„ for all sufficiently large n. 

The following result I Matheronl , |l975[ ) provides useful conditions for convergence. 



Lemma 2.1. For F^, F ^ T,n > 1, F^ ^ F as n ^ oo if and only if the following two conditions hold 

(2.1) For any y E F, for all large n, there exists y„ G Fn such that d{y.^, y) — > as n oo. 

(2.2) For any subsequence {rik}, ^fUuk ^ -^"t converges, then lim G F. 
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Furthermore, convergence of sets Sn S in IC is equivalent to the analogues of and holding as 

well as supj->]^ sup{||a;|| : x g Sj} < oo for some norm \\ ■ \\ on R"^. 

Note that if the sets are random elements of /C and 5 € /C is non-random, then Lemma |2. II can be used 
to characterize almost sure convergence or convergence in probability. We are going to define random sets 
in the next subsection. 

Definition 2.1 (Hau sdorff Metr i c). S uppose d : R'' x R'^ ^ M+ is a metric on R'^. Then for S,T E K., define 
the Hausdorff metric (jMatheronl . 1 19751 ) D : JC x K. ^ R+ hy 

(2.3) D{S, T) = inf{<5 :ScT\Tc S^}, 
where for S E K, and 6 > 0, 

(2.4) ^ {x : d{x, y) < 6 for some y E S} 
is the (5-neighborhood or (5-swelling of S. 

The topology usually used on /C is the myopic topology with sub-base elements {/C^, F E J-} and {/Cg, G E 
Q}. The myopic topology on /C is stronger than the Fell topology relativized to JC. The topology on /C' = 
JC \ {0} generated by the Hausdorff metric is equivalent to the myopic topology on IC' (iMolchanovl . l2005l 
page 405). 

In certain cases, convergence on can be reduced to convergence on IC. 

Lemma 2.2. Suppose F„ and F are closed sets in T and and that there exist /Ci C /C satisfying 

(1) \J K^E. 

KeKi 

(2) For S > and K E IC, we have E Ki. 

(3) F„{^K F{^K, WKeJCi. 
Then Fn F in T . 

Remark 2.1. The converse is false. Let E = M, F„ = {1/n}, F = {0} and K = [-1,0]. Then Fn ^ F but 

Fnf]K = 1!}^ Ff]K = F. 

The operation of inters ection is not a co ntinuous operation in T x i Molchanov , 20051 page 400); it is only 
upper semicontinuous (iMatheron . Il975l page 9). 



Proof. We use Lemma [2.11 If a; e F, there exists K E JCi and x E K. So x E F n K and from Lemma [2.1) 
since Fn H K ^ F n K as n ^ oo, we have existence of Xn E Fn H K and Xn x. So we have produced 
Xn E F and a;„ ^ a; as required for (|2.ip . 

To verify (|2.2p . suppose {xn^} is a subsequence such that Xn^, E and {a;^^} converges to, say, Xoo- We 
need to show Xoo & F. There exists Kac E JCi such that a;oo G ^oo- For any 5 > 0, E E JCi for all 
sufficiently large rife. So x^,^ E F^^ nif^. Since ^K^^ FnK^,, we have liiuk EFHKi,. 
SoxE F. □ 

The next result shows when a point set approximating a curve actually converges to the curve. For this 
Lemma, C(0, 1] is the class of real valued continuous functions on (0,1] and Z?;(0,oo] is the class of left 
continuous functions on (0,oo] with finite right hand limits. 

Lemma 2.3. Suppose < x(-) E C(0, 1] is continuous on (0, 1] and strictly decreasing with lim^j^o 2;(e) = oo. 
Suppose further that ?/«(■) G Di{0,l] and y{-) E C(0, 1] and y-n ^ y locally uniformly on (0,1]; that is, 
uniformly on compact subintervals bounded away from 0. Then for k = k{n) oo, 

F„ {(x(^),2;„(^)); 1 < j < fc} ^ F := {{x{t),y{t))-Q < i < 1} = {{u,y{x^ {u)))-x{l) <u<^}, 
in T . 
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Proof. Pick t e (0, 1], so that (x{t),y{t)) e F. Then 

Fn 3 {x{\kt^/k),yn{\kt^/k)) ^ {x{t),y{t)) G F, 

in M^, verifying (|2.ip . For (|2.2p . Suppose {x{j{n')/k{n'),yni{j{n')/k{n')) e F„' is a convergent subsequence 
in R^. Then /fc(n')} is convergent in R and because x{-) is strictly monotone, {i(n') /fc(n')} converges 

to some / € (0, 1]. Then 

^^„' 3 (a;(jK)/fcK),yn'(jK)/fcK)) ^ (a:(0,2/(0) € F, 
which verifies (|2.2p . □ 



2.2. Random closed sets and weak convergence. In this section, we review definitions and character- 
izations of weak convergence of random closed sets. In subsequent sections we will show convergence in 
probability, but s ince the limit set s will be non -random, weak convergence and convergence in probability 
coincide. See also iMatheronI (|l975l ). iMolchanovl (|2005V 

Let (ri, y^, P ) be a complete probability space. T is the space of all closed sets in R'' topologized by the 
Fell topology. Let aj^ denote the Borel a-algebra generated by the Fell topology of open sets. A random 
closed set X : ^ ^ T \s a. measurable mapping from P ) to (JF, cr^r). Denote by P the induced 

probability on crjr, that is, P = P o X^^ . A sequence of random closed sets {X„}„>i weakly converges 
to a random closed set X with distribution P) if the corresponding induced probability measures {Pn\n>i 
converge weakly to P, i.e.. 



P„(6) = p: o X-\B) ^ P{B) = P' o X-\B), 



for each B G crjr such that P{dB) — 0. 

This is not always straightforward to verify from t he definition. W e find useful the following characteri- 
zation of weak convergence in terms of sup- measures (jVervaati . Il997f ). Suppose h : R."^ i-^ R+ ~ [0, oo). For 
X C R*^, define h{X) — {h{x) : x G X} and ft,^ is the sup-measure generated by h defined by 

h^'iX) = sup{/i(x) :x&X} 



( Molchanov . 2005 . Vervaatl . 1997 ). These definitions permit the following characterization ( Molchanov . 20051 
page 87). 

Lemma 2.4. A sequence {Xn}n>i of random closed sets converges weakly to a random closed set X if and 
only i/E/i^(X„) converges to Kh"^ (X) for every non-negative continuous function h : Mf^ i— > R with a bounded 
support. 

2.3. Convergence of samp l e qu antiles. The sample quantile is a strongly consistent estimator of the 



population quantile (jSerflingj (|1980), page 75). Th e weak c onsistency of sa mple quantiles as estimators of 



bmirnov (|l949l ): see also (|Resnickl [l999i page 179). We will make use of 



population quantiles was shown by 
the Glivenko-Cantelli lemma describing uniform convergence of the sample empirical distribution and also 
take note of the following quantile estimation result. 

Lemma 2.5. Suppose F is strictly increasing at F^{p) which means that for all e > 0, 

F{F^{p-e)) <p<F{F^{p + e)). 
Then we have that the p*^^ sample quantile, X^np'\:n is a weakly consistent quantile estimator, 



X 



\np\: 



F-{p) 



As before, \np'\ is the 1** integer > up and Xi-n is the i*"^ smallest order statistic. 
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2.4. Regular variation. Regular variation is the mathe m atical underpin ni ng of heavy tail analysis. It 
is discussed in many books such as .Bingham et al.1 (Il987l). de Haan ( 197Cll ). de Haan and Ferreira (t2006. ). 
Geluk and de HaanI (ll987l ). lResnic5 (|l987l . l2006l ). ISenetal (|l976h . 



Definition 2.2 (Regular variation). A measurable function [/(•): K+ R+ is regularly varying at oo with 
index p e M if for a; > 

U{tx) _ 



(2.5) lim 



t^oo U{t) 



We write U G RVp. 



Remark 2.2. When p = we call U{-) slowly varying and denote it by L{-). For p G R, we can always 
write U E RVp as: 

(2.6) U{x) = xPL{x) 

where L{-) is slowly varying. 

3. QQ PLOTS FROM A KNOWN DISTRIBUTION: RANDOM SETS CONVERGING TO A CONSTANT SET 

In this section, we will use the results in Section [2] to show the convergence of the random closed sets 
given by (|1.2p consisting of the points forming the QQ plot to a non-random set in . First we consider the 
easiest case where the random variables are iid from a uniform distribution. Then we consider more general 
distributions which are continuous and strictly increasing on their support. This result will be derived from 
the uniform case. Because we are interested in heavy tailed distributions, our final corollary in this section 
is about the Pareto distribution which is the exemplar of the heavy tailed distribution. 

3.1. The Uniform case. The first simple example is QQ plot from the uniform distribution. 
Proposition 3.1. Suppose [/i, C/2, ■ ■ • , C/n iire iid U(0,1). Denote the order statistics of this sample by 

Ul:n < U2:n < ■ ■ ■ < Un:n- Define 

(3.1) Sn := {(^-T- U^■.n), 1 < « < 

n + 1 

and 

(3.2) 5 := {(x,x) : < a; < 1}. 
1 hen On — > o m a^2 • 

Proof. We apply the convergence criterion given in Lemma 12.11 The empirical distribution Un{x) = 
Sr=i ^(^« — ^) converges uniformly for almost all sample paths to a;, < a; < 1. Without loss of 
generality suppose this true for all sample paths. Then for all sample paths, the same is true for the inverse 
process U^{p) = C/[„p]:„, < p < 1; that is 

sup |J7p„p-|:„ -p| ^ 0, (n^oo). 

0<f)<l 

Pick < y < 1 and let y = (y, y) g S. For each n, define y„ by 

(3.3) y« = 

so that y„ e 5„. Since \ny — \ny'\ | < 1 , [ny]/n + 1 y and since C/prij,]:„ y, we have y„ — > (y, y) e S. 
Hence criterion (|2.ip from Lemma [2J] is satisfied. 

Now suppose we have a subsequence {n^} such that y„ G converges. Then y„ is of the form 
Vrik ~ (*nfc/("-fc + 1)7 Ui^^;nk) for somc 1 < < n and for some x G [0, 1], we have iuk/iji'k + 1) ^ a; and 
hence also in^/uk — > x. This implies 

"■k 

and therefore y„ {x,x) as required for (|2.2p . □ 
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3.2. Convergence for more general distributions. Now consider a distribution function F wlriclr is 
more general than the uniform, assuming that F is strictly increasing and continuous on its support so that 
F^ is unique. 

Proposition 3.2. Suppose Xi, . . . , Xn are iid with common distribution F{-) and Xi-n < X2:n < ■ • • < Xn-.n 
are the order statistics from this sample. If F is strictly increasing and continuous on its support, then 

T;, := ),X„0; 1 < z < n} 

n + 1 

converges in probability to 

T :— {(x,x);x G support(F)} 

in J^2- 

Proof. According to Lemma 12.41 we must prove for any non- negative continuous /i : i-^ with compact 
support that as n — > oo, 

E(/.^(7;,)) ^E(/j^(r)). 

Since F is continuous, F{Xi), F{X2), . . . , F{Xn) are iid and uniformly distributed on [0,1]. Therefore 
from Proposition 13.11 we have that 

(3.4) Sn {(^— , F(X,:„)); 1 < i < n} ^ {^— , C/,:„); I < i < n} S ^ {{x, x);0<x<l} 

n+ 1 n+ 1 

in /C2. 

We now proceed by considering cases which depend on the nature of the support of F. We will need the 
following identity. For any closed set X, function / : 1-^ R+ and function ■0 : K^, we have, 

(3.5) fot^iX)^ sup /(i) = sup/(^(s)) = sup/o^(.s) = (/o^)^(X). 

tGi/j(X) sex sex 

Case 1: The support of F is compact, say [a,b]. This implies F^{0) = a, F^{1) = b. Define the map 
g : [0,1]2 ^ [a,6]2 by 

g{x,y) = {F^{x),F^{y)). 

Since F is strictly increasing, observe that g{Sn)=Tn and g{S) = T. Define : R^ i-^ R^ as the extension 
of ,g to all of R2: 

9*{x,y) = {gi{x),gi{y)) 

where 

{F^{z), 0<z<l 
a, z < 

b, z>l. 

This makes g* ■.M.'^ 1^ R^ continuous. Since both 5„ and S are subsets of [0, 1] x [0, 1], we have g{Sn) = g*{Sn) 
and g{S) = g*{S). Let / be a continuous function on R^ with bounded support and we have, as n —> 00, 
using (|3.5p . 

EnX,) = E/^(g(5„)) = E/^(g*(5„)) 

= Eifog*y{S^)^E{fog*y{S). 

The previous convergence results from fog* : R^ 1— *■ R+ being continuous with bounded support, 5„ 5, 
and Lemma 12.41 The term to the right of the convergence arrow above equals 

= E/^(5*(5)) = Er (g(5)) = E/^(r). 

Therefore 7^ converges to T weakly and since T is a non-random set, this convergence is also true in 
probability. 
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Case 2: The support of F is R — (—00, 00). Now define g : (0, 1)^ 1^ by 

g{x,y) = iF^ix),F-iy)). 

Since F is strictly increasing, g{Sn) — %i and g(5n(0, 1)^) = T. Let / be a continuous function with bounded 
support in [— M, M]^, for some M > 0. Extend the definition of g to all of by defining : R^ 1-^ R-^ as 

where 

{F^(z), -Af < i^^(z) < M, 
F'-(-M), F-(z)<-Af, 
F^{M), F^{z)>M. 

Therefore g* : R^ 1-^ R^ is continuous. Now note that since support{f) C [—M,M]^ and g{x,y) = g*{x,y) 
for {x, y) G [— M, M]^, we will have f o g = fog*. Therefore 

E/^(7;,) = = E(/ o g^iSn) = E(/ o g*)^(5„) ^ E(/ o g*y{S). 

As with Case 1, the convergence follows from / o <;* : !—> R_|_ being continuous with bounded support, 
Sn ^ S and Choquet's theorem 12.41 The term to the right of the convergence arrow equals 

- E(/ o g^iS) = Er (g(5)) = Er (T). 

Therefore 7n converges to T weakly. But since T is a non-random set, this convergence is true also in 
probability. 

Case 3: The support of F is of the form [a^oo) or (—00,6]. This case can be examined in a similar manner 
as we have done for Cases 1 and 2 by considering each end-point of the interval of support of F according 
to its nature. □ 

Corollary 3.3. If F is exponential with parameter a > 0, i.e., F{x) = 1 — e~"^, x > Q, we have 

{(--log(l '—),X„,);l< i<n)^{{x,x) :0<x<(X3}. 

a 71 + 1 

Corollary 3.4. If F is Pareto with parameter a > 0, i.e., F{x) = 1 — x^" , x > I, we have 

{(- log(l - ^-), logX,:„); 1 < z < 71} 4 {{x, -) : < a: < 00}. 
71 + 1 a 

4. QQ PLOTS: Convergence of random sets in the regularly varying case 

The classical QQ plot can be graphed only if we know the target distribution F at least up to location 
and scale. We would like to extend the idea of QQ plots to the case where the data is from a heavy 
tailed distribution; this is a semi-parametric assumption which is more general than assuming the target 
distribution F is known up to location and scale. 

We model a one-dimensional heavy-tailed distribution function F by assuming it has a regularly varying 
tail with some index —a, for a > 0; that is, if X has distribution F then, 

(4.1) P[X > x] = l~ F{x) ^ F{x) x-'^Lix), x>0 

where L is slowly varying. In at least an explorat o ry con text, how can the QQ plot be used to validate this 
assumption and also to estimate a? (See iResnic 3 (l200fil . page 106).) 

Notice that if we take L = 1, F turns out to be a Pareto distribution with parameter a. In Corollary 
p.4p . we have seen that if F has a Pareto distribution with parameter a, then S„ defined as: 

(4.2) 5„:={(-log(l L^),logX,:„); 1 < i < n} 
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converges in probability to the set 

(4.3) S = {{x,-);0<x<oo}. 

a 

Keeping this in mind, when we have a general F E RV-a, let us define 5„ exactly as in (|4.2|) . Then we 
are able to show that, 5„ converges in probability to the set 

(4.4) S = {(ax, x + - \ogL(F^(l - e""^))); < a; < oo}. 

a 

But, since we do not know the slowly varying function L{-), this result is not useful for inference pur- 
poses. Estimating a from such a set is not possible unless L(-) is known, nor is it clear how 5„ graphically 
approximating such a set would allow us to validate the model assumption of a regularly varying tail. 

Consequently we concentrate on a different asymptotic regime where the asymptotic behavior of the 
random closed set can be freed from L{-). For a sample of size n from the distribution F, where F € RV-a, 
we consider the upper k = k(n) order statistics of the sample where k(n)/n and construct a QQ plot 
simil ar to (14.21). We assume th at dy^{-,-) is some metric on ^ which is compatible with the Fell topology. 
Note IFlachsmeveij (|l963/1964l) characterized the metrizability of the Fell topology and since R'^ is locally 
compact, Hausdorff and second countable his results apply and allow the conclusion that ^ is metrizable 
under the Fell topology. 

For what follows, when A e J-2, we write A + (ti, — {a + {ti, a E A} for the translation of A. 

Proposition 4.1. Suppose we have a random sample Xi,X2, ■ ■ ■ ,Xn from F where F e RV-a o,nd ^(1) > 
^(2) > ■ • ■ > -^(n) o-i^e the order statistics in decreasing order. Define 

Sn = {(-log--^,logX(j));j ^l,...,k} 

where k = k{n) 00 and k/n as n 00. Also define 

%, = {(x, -)]x> 0} + (-log— ^,logX(fc)) 
a n + 1 ' 



Then as n 00 



djr{Sn,Tn) ^ 



Remark 4.1. So after a logarithmic transformation of the data, we make the QQ plot by only comparing 
the k largest order statistics with the corresponding theoretical exponential distribution quantiles. This 
produces an asymptotically linear plot of slope 1/a starting from the point (— log logX(fe)). 

Proof. Define 

X 

5l = {(-log^,log-^);l <j <fc}, andr = {(x,-);0<a^<oo}. 

Note that we can write 

5: = {(-logf,logf(^);l<j<fc}^{(-logt,log^);te{i,...,^,l}}, 
k A(fc) A(fc) k k 

and also write T as 

r = {{x, -);x > 0} = {(- logt, -- logt); < t < 1}, 
a a 

where we put x — — logt. We first show 5^ T. 



Referring to Lemma 12731 set 

X 1 

x{t)^-\ogt, y„(t) ^log-[Ml, y{t) = logi, 0<i<l. 

J From iResnick ( 2006 . page 82, equation (4.18)), we have y„ y, in 15/(0, 1], the left continuous functions 
on (0, 1] with finite right limits, metrized by the Skorohod metric. Suppose {n"} is a subsequence. There 
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exists a further subsequence {n'} C {n"} such that F„/ y, in Di{0, 1], and by Lemma [2.31 5^/ T in 

JP. Therefore ^ T, in J^, as n —^ oo. 

Now observe that with a„ := (— log logX(fe)), we have 



5n ={(-log--^,logX(j));j = l,...,k} 



={(- log ^, log ^); J = 1, . . . , fc} + (- log logX(fc)) 



Also, 



=iS„ + a.„. 



t;, {[x,-)-x> 0} + (-log— ^,iogX(fe)) = r + a„. 

a n + 1 



Now, since djr[S,^,T) — > 0, we get 

djr{S,i, %i) = djr(5„ + a„, T + a„) = djr{S^, T) 0, 
as required. □ 

5. Least squares line through a closed set 

5.1. Convergence of the least squares line. The previous two sections gave results about the convergence 
of the QQ plot to a straight line in the Fell topology of ^2- It is of interest to know whether some functional 
of closed sets is continuous or not and, in particular, the slope of the least squares line through the points of 
QQ plot is one such functional. The slope of the least squares line is an estimator of sca l e for location/scale 
families an d this leads to a n estimate of the heavy tail index a; see Beirlant et al. ( 1996[ l. [Kratz and Resnic^ 
(|l996f ) and lResnicd (|2006l . Section 4.6). 



Intuition suggests that when a sequence of finite sets converges to a line, the slope of the least squares 
line should converge to the slope of the limiting line. However there are subtleties which prevent this from 
being true in general. We need some restriction on the point sets that converge, since otherwise, a sequence 
of point sets which are essentially linear except for a vanishing bump, may converge to a line but the bump 
may skew the least squares line sufficiently to prevent the slope from converging; see Example 15.11 below. 

The following Proposition provides a condition for the continuity property to hold. First define the 
subclass Tfiaitc or line C T2 to bc the closed sets of T2 which are either sets of finite cardinality or closed, 
bounded line segments. These are the only cases of compact sets where it is clear how to define a least 
squares line. For F £ J-finite or linej the functional LS is defined in the obvious way: 

LS{F) — slope of the least squares line through the closed set F 

For the next proposition, we consider sets Fn '■= {{xi(n),yi{n)) : 1 < ? < A:„} of points and write 
Xn = and ?;„ = yj(^)/kn- Also, for a finite set iS„, #5„ denotes the cardinality of 5„. 

Proposition 5.1. Suppose we have a sequence of sets Fn := {{xi{n)^yi{n)) : 1 < i < A:„} G /C2, each 
consisting of kn points, which converge to a hounded line segment F G K,2 with slope m where |m| < 00, as 
kn 00. Then 

LS{Fn) LS{F) ^ m 

provided the following condition holds: 

#( {(i„ -5,xn + 5)x (y„ -5,yn + 5)} fl 
(5.1) 3 (5 > 0, such that p'/ ^ — ^ ^ Pi G [0, 1). 

This Proposition gives a condition for the continuity of the slope functional LS{-) when {Fnn > 1} and 
F are bounded sets in J^nnite or line- The next example shows the necessity of condition (j5.1[) . which prevents 
a set of outlier points from skewing the slope of the least squares line. 
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Example 5.1. For n > 1, define the sets: 

i^„ = {(^,0),-n<*<n;(i(l + ^) i(l + |^)),0< j<2"} and F = [-1, 1] x {0}. 

We develop features about this example. 

(1) For the cardinality of F„ we have 

= fc„ = 2" + 2n + 2. 

(2) We have Fn ^ F K,^- As before, denote the Hausdorff distance between two closed sets in /C2 by 

•) and we have D{Fn, F) < 3/n ^ as n ^ 00. 

(3) Condition (|5.ip is not satisfied. To see this pick any n > 1 and observe 

_ _ _ _ 3(2" + 1) _ 3(2" + 1) _3_ 
^" ~ ^" ~ 2n(2" + 2n + 2) ~ 2nfc„ 2^' 

Fix S > 0. For all n so large that 5 > l/(2n) we have 

#({{Xn-S,Xn+S) X {y.n - 5,yn+6)}C\Fn] 2" + 1 

Wn -2" + 2n + 2^1' 

Obviously for this example, m — LS{F) = 0. However, if denotes the slope of the least squares line 
through Fn then we show that m„ ^ 1 ^ = m. To see this, observe that conventional wisdom yields, 

E {yi{n) - y){xi{n) ~ x) 

(5.2) m„ 



J2 {x,{n)-x) 



For the numerator we have, 



(j/i(n) - 2/„)(xi(n) - a:„) = ^ yi{n)xi{n) - knVnXn 

{xi{n},y,{n))eFr, (i, (n) (n)) Gi=^„ 

i=0 \j=0 



and using the identity J2f=i P = ^{^ + 1)(A^ + 5)/3 = N{N + 1)(2A^ + l)/6, we get the above equal to 



2> 



' ^.(2" + l) + ^^"(^" + ^)(^" + ^)-A(2-^ + l)^^ 
^ 22" 3 4fc„ ^ , 
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2" + !/. 2" + i 9_^^„_^ ^ 

I. ^ 



3-2" Akn ) I2n^' 

For the denominator, we use the calculation already done for the numerator: 

{Xi{n) - Xn)"^ ^ E Xi{nY-kn{xnT 

{x^{n),yi(n))<£F„ {xi{n),y,{n))£F,, 

\-(^^^^\- 1 , / 3(2" + i) 

i—~n j—0 



y^(-)^+ yi{n)xi{n) ~ knynXn 

z — / ,^ z — / 

i=-'n {xi(n),yi(n))£F„ 
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2n{n + l)(2n +1) fc„ ^ ^" \ 



12n2 U2n2' 12n2 ' 
Combining the asymptotic forms for numerator and denominator with (j5.2p yields 

so TO„ ^ 1 7^ TO, as claimed. □ 
Proof of Provosition \5. 11 For (^^(tt,), yi{n)) € Fn, we can write 

(5.3) yi{n) = mxi{n) + Zi{n) I <i <K 

We want to show that to„ = LS{Fn) m = LS{F), as n ^ cxd. Fix e > 0. We will provide N such that for 
n > N, we have |to„ — to| < e. 

First of all, condition (|5.ip allows us to fix (5 > such that 

Ps ■=Pn^ pr >P<^- 

Choose A^i such that for n > A^i, we have Pn < or equivalently that 1 — p„ > For 77 > and 

F E IC2, recall the definition of the 77-swelling of F: 

(5.4) F^ = {x : d{x, y) < -q for some y € F}. 

Since D{Fn, F) ^ in IC2, we can choose N2 such that for all n > N2 we have Fn C F'^'^ where 

ei ^ - — = £>!£- 



4Vl + to2(2 + 2m + e(l - p)) 4\/l + to^ 

and we have set 

X ^ X 

Si := < 6. 

1 + TO + ie(l -p) 

The choice of Si is designed to ensure that if for some (xi{n),yi{n)), we have \xi{n) — Xn\ < Si, then 

{xi{n),yi{n)) e (x„ -S,Xn + S) x {y,, - S,yn + S). 

This follows because 



\xi{n) - Xn\'V\yi{n) ~ yn\ < ^1 + mSi + 2ei\/l + m? . 
See Figure [2I from the definition of ei we have this equal to 

(5.5) =Si+ mSi + 2 ^'^ f-^ ~ ^ V l + to2 = Sijl + m + ^^'^ ~ ) ^ S. 

4Vl + to2 2 

Let N = Ni V N2 and restrict attention to 71 > iV. Since F„ C , we have for all 1 < i < A:„ that 
{xi{n),yi{n)) £ F'^^. By convexity of F'^^, (a;,y) € F'^^. Therefore, referring to Figure [1] we have 

\zi{n) - Zn\ <|j/i(n) - mxi{n)\ + |?;„ - TOi„| 



(5.6) <ei VT + ^+ ei VT + ^ = 2eiVT+TO2. 

Using the representation (|5.3p we get, 

fc„ fc„ 

(2/i(") - Vn){xi{n) - Xn) (ziin) - z„)(a;j(n) - x„) 

^^■'^'> nin^— ^m+— , 

X;(xi(n) - i„)2 Z;(a;i(n) - x„)2 

i=l 1=1 
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Figure 1. The geometry of the neighborhood of the hnc F. 



Therefore, 



i=l 



J2 {xi{n) - a;„)2 



J2 \zt{n) - Zn\\Xt{n) - Xn\ 



< 



J2 \xiin) - Xn 



- x^^ 



J2 {xt{n) - Xn 

i=l 



where the last inequahty follows from ([5] 

For convenience, define the following notation: 



\S{x)\ 



S\x) 



<p 



B{{x,y),S) 
Therefore 



\Xi{n)-Xn\, 

\xi (n) —Xn I <p 

ixi{n) - Xn)"^ , 

\Xi {n) — Xn I <p 

(x — (5, X + (5) X {y — S,y -\- 5). 



\Six)\>, 
S^{x)>p 



\Xi {7l)~Xn I >p 



Xn I ; 
Xn ) 1 



J2 \xi{n) - Xn 

i=l 

{xi{n) - Xn 

i=l 



\S{x)Us, + \Six)\>s^ {\S{x)\^s, + \S{x)\>sJ/SHx)>s, 



{S^x)<sJS^x)>s,+l) 



^ i\Six)Us, + \S{x)\>s,) ^ 1 f\Six/S^)U, 



< 



SHx)>s, - Si \S^x/di)>i 

1 f #{{xi{n),yi{n)) e F„ : \xi{n) - x„| < 5i} 



< 



^1 V#{(2;»(")'y»(")) € Fn ■■ \Xi{n) - Xn\ > 61} 

1 f #{{xi{n),yi{n)) € F„ : {x^{n),y^{n)) G B((x„, y„), (5)} 



(5i V#{(a;j(n),yj(n)) € F„ : (a;i(n), ^ B((x„, ?/„), (5)} 
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Figure 2. The geometry of a neighborhood of the hne; support for ([5 



The choice of 6i justifies the previous step by (|5.5p . The previous expression is bounded by 

2 



and we recall p < 1. 
Consequently 



- S^^l-Pn^'' - S^^l-p^'' 6,il-py 



J2 \xiin) - Xr. 



\mn-m\ =2eiVl + m2-^ 



< 2eiVl + m2 



2=1 



Slil-P) 



This completes the proof that m„ — > m under condition (|5.ip . 



□ 



Corollary 5.2. If Xn ^ jJ-x < oo and yn ~> p,y < oo, as n ~> oo, then Provosition \5.1\ holds if we replace 
ix„,yn) in (EJP by {p^,py). 

Proof. In place of condition (|5.ip we are assuming 
(5.8) 

Let us fix (5 > such that 



3 ^ > such that p^^ #{(M.-^,M. + ^)><(M,->5,/., + <5)}nFn _ ^ 



. ._ - 2(5, fix + 2^) X (m, - 2S, t,y + 2^)} n Fn 

Pn ■ JMIP '■ptl.'Jjlj. 

Since Xn fJ-x < oo and yn ^ fiy < oo, there exists N* such that n > N* implies that {xn,yn) G 
{Px — S, p,x + S) X {py — (5, + S). Hence for n > N* 

#{(i„ - ^, x„ + (5) X (y„ - S,y„ + S)} f] 



< 



#{{px - 26, px + 25) X (My - 26, fly + 25)} f] F„ 



= p:-.pe [0,1) 



14 



B. DAS AND S.I. RESNICK 



Now choose Ni > N* such that for aU n > iVi , we have p„ < ^-y^ . This also means that 1 — p„ > . 
The rest of the proof is the same as that of Proposition 15. II □ 



6. Slope of the LS line as a tail index estimator 

For heavy tailed distributions, the slope of the least squares lin e through the QQ plo t made by the upper 
kn lar gest order statistics is a consistent estimator of 1/a. See Beirlant et al. ( 19961 ). Kratz and Resnick 
( 19961 ) and Resnick ( 20061 . Section 4.6). We connect the ideas of the previous section with this result. 

Proposition 6.1. Consider non-negative random variables Xi,X2, ■ ■ ■ , Xn which are iid with common dis- 
tribution F where F e RV^a and -'i^(i) > ^(2) ^ • • • ^ -^(n) '^'"^ order statistics in decreasing order. The 

1—1—1 P 

sets Sn and %i were defined in Proposition \4 ■ 1\ where we proved djr{Sn,Tn) — » 0, assuming k = k{n) — > oo 

and k/n —f Q as n oo. For convenience we defined 5„ — Sn + a„ and T = 7^^ + a„ where a„ was a random 
point. Write 

= {(-log77-,log-^;j = 1, . . . ,/c„} = {{xj{n),yj{n));j = 1, . . . ,fc„} ( say) and T ^ {ix,-);x> 0}. 



X, 



(fe) 



a 



Then, 
(6.1) 

as k 



LS{S'n) = LS{Sn) ^ - - LS{Tn) = LS{T), 
a 



oo and kn/n — s- as n —> oo. 



The result is believable based on the fact that djr(Sn:Tn) — > 0. However, since neither Tn nor T are IC2 
sets, some sort of truncation to compact regions of is necessary in order to capitalize on Proposition l5.ll 
For some integer M > 2, define 

Km = [0,M] X [0,—], 



and let 



M 



a 



5„ = 5„ n Km and = TnK 



M- 



Proof. Some preliminary observations. Clearly, LS{Sn) ~ LS{S^ + a„) = LS{Sj^) and with Xj{n),yj{n) 
defined in the statement of the Proposition, 



LS{Sn) 



SxY — SxSy 
Sxx - (Sx)"^ ' 



where, as usual, 

Sx = 

SxY = 



Sy 



E 



(xj{n),yj{n))eS'„ 



- y 



{xj {n),Xj (n))e5„ 



We need similar quantities 5^^, Sy , Sxy corresponding to averages of points restricted to Km, so for instance 



■'X ^'-'Y ^ '-'XY 

and fc*^ = #'5ri • ^ simple calculation given in Resnick (|2006l . page 109) yields as k ^ 00, 



(6.2) 



S 



X 



^(-log 



1=1 



loga;)da; = l, ^^^^ = ^(- log |)2 



i=l 



(— loga::)^c?x 
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while for Sy we have 
(6.3) 



k 



X 



i=l 



(0 ^ 1 
(fe) a 



since Sy is the Hill e stimator and is consistent for 1/a ( Csorgo et al. . 19851 MasonL 19821 Mason and Turova . 



1994lResnickl . l2006f ). 



We need the corresponding hmits for Sx , S^x^ ■ These calculations and subsequent calculations are 
simplified by the following facts: 

(1) The ratios of order statistics process converges, as fc ^ oo, k/n ^ 0, 

X( 



(6.4) 



in A(0,oo] (jResnickl . [iooi page 82). 
(2) Define the random measure 



1 " 



on (0, oo], which puts mass l/k at the points 1 < i < n}. Then 



(6.5) 



. p 



in the space of Radon measures on (0, oo], where Vaix, oo] = a; a; > (jResnickl . l2006l page 83). 
(3) The number of points k^^ in 5„ satisfies, as n — > oo, k —> oo, k/n —> 0, 



(6.6) 



k^'/k 4 1 - e-''. 



To see this, observe 



fc*Vfc =i#{j <k:k>j> ke-^' and ^ < e^*^/"} 
k 



(k) 



1- (e-) 



We continue using these three facts. For Sx we have 



5, 



M 



k^' 



E 



k^' 



E 



fog- 



0<log X^J■,/X^k)<2M/a 



Set 



j:k>j>ke~" 



1 



j:k>j>ke-" 



1 - e-A^ 
=:l + ex(M) 



log a; dx 



1 - e-A'^ 
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where ex{M) ^ as M — > oo. Also, Sx and {Sx^ are close asymptotically since 

P[S'^^[s^y]=p{ U [log|^>2Af/a]} 



since 



-.p[\og > 2M/a] 0, 

A(fe) 



We conclude 

(6.7) S^^ ^l + ex{M)■.^^,f, 
with ex{M) — > as M ^ cx), and in a similar way we can derive that 

(6.8) S^^x^'^ + ^xx{M), 



where exx{M) — > as Af ^ oo. For Sy we have 



j:k>j>ke-" ^ ' 

a<\ogX^f)/x^k)<2a-^M 



1 1 ^0) 



i:0<logX(j)/X(fc)<2a-ij\/AlogX(^j^^_M^)/X(j) 



logy i>„(d?/) 



2^ /.2a"-'AfAa"U'/ 



logyi'a(dy) 

M/a 



1 - e-A^ 

where ^ ^ as Af ^ oo. We conclude 



(6.9) S^' 4 ^f^. 
To prove (|6.ip . we follow the following outline of steps. 

• Step 1: Prove 5^*"^ ^ T^' . 

• Step 2: Verify that Corollary 15.21 is applicable by showing that the analogue of (|5.ip holds. This 
permits the conclusion that 

Coupled with ^J^, and (US]), this yields 

(6.10) = - + exy(M) + Op(l), 

a 

where limjv/^oo £xy(-^^) = and Op(l) — > as n ^ oo. 

• Step 3: Compare Sxy and S'^V and Check that 

(6.11) lim limsupP[|S*fy - Sxy\ > ?7] = 0, V?7 > 0. 

_ p _____ 

This gives S'jcy -^2/ a which coupled with (|6.2p and (|6.3p imphes 
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We may check Step 1 using a very minor modification of Lemma l2.3[ following the pattern of proof used 
for Proposition 14. II For Step 2, the challenge is to verify condition (|5.ip holds and we defer this to the end 
of the proof. Thus we turn to Step 3. 

First of all, we observe that Sxy ^'^'^ Sxy average, respectively fc*^ and k terms but there is no need to 
differentiate: For any rj > 0, 



{xj{n),yj{ri))GS'„ 



1 1 I 



ixjin),yj{n))GS'^" 



and dividing the sum by k yields 



=P 



S 



M 
XY 



1 - 



> V 



Since Sjly convergent in probability, it is stochastically bounded and since, as n — s- 00, 



k 



we conclude 



(6.12) lim limsupP 



Next observe for 77 > 0, 
1 
k 



l-(l-e-*0 = e 



Xi{n)yi{n) - - 
k 



-M 



M- 



0, 



^ Xi{n)y^{n) 



> V 



= 0. 



E 



1 ^ 

,{n)y,{n)—- ^ a^jHy^H > v] < P{ \J > 



,2M/a 



(6.13) 



k>j>ke-" 



k>j>ke-^'' 



(k) 



]} 



<P[^^iI^^ > e^*^/"] - 0, (n^oo). 



X, 



(fe) 



Note that by the Cauchy-Schwartz inequality, 



k>j>ke-'-' 

Furthermore 



l<j<ke-" 



l<j<ke-" 



'g-^(rfce-Ml)/^(fc) 



l<j<ke-^' 



{\ogyfvy,{dy) 



l<j<ke-^' 



and using ()6.4|) . we have for some c > 0, all large n and some M that the above is bounded by 



(6.14) 



M 



(logy)2z>„(dy) +Op(l). 



Assessing (I6.12p . (|6.13p and (|6.14p . we see that (|6.1ip will be proved if we show 



(6.15) 



lim lim sup P 



{\ogyfyn{dy)>ii -0, (V?7 > 0). 



M 



This t reatment is similar to the stochastic version of Karamata's theorem ( Feigin and Resnic^ ( 1997 ) . Resnick 
pood page 207). For < < 1 A a and large M, the integrand (logy)^ is dominated by y^. Bound the 
integral by 

/>oo 

i>„(y, 00] Cy^-^dy + Af«i>„(M, c3o]. 



M 
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If we let first n — > oo and tlien A/ — s- cx), for the second piece we have 

p 



M'^i>n{M,oo] ^ M^Vc,{M, oo] = Af^"" ^ 0. 



Now we deal with the integral. Set h{t) = (1/(1 - F))^{t) so that X(k)lb{n/k) 1 (jResnickl . l2006l page 
81). For 7 > 0, 

POO nOG 

P[ / i>„(y, ^] Cy^-^dy > 77] =P[ / i>„(y, ^] Cv^-^dy > 77, 1 - 7 < X^k)/b{n/k) < 1 + 7] + o(l) 



M 



r H ex./6(n/fc) ((1 - 7)y, oo]C2/'^-M2/ >v] + o{i). 

1=1 

Ignore the term o(l). Markov's inequality gives a bound 

<{const) j^^ i?(-^P[X, >fe(n/fc)(l-7)y]jC/-idy 



i=l 



= (consi) / -F(b{nlk){l~^)y])Cy^-Hy. 
J hi ^ 



and a pplying Karamata's theorem (iBingham et al.l . ll987llde Haanl . I1970L I iGeluk and de Haai]lll987llResnickl . 
20061 ). we have as n ^ 00 that this converges to 



={const) / ((1 - 7)2/)-"Cy^~^rfy * 0, 

J hi 

as required. This finishes Step 3 and completes the proof modulo the verification that (|5.8p can be proven 
for this problem. 

The remaining task of checking (|5.8[) proceeds as follows. Recall fi^ and fiy from (|6.7p and (|6.9p . Fix 
Af . Then for p'^ in ((5?8| . we have 



1 



k 



J 



X. 



{'}) , ,,M 



(J) 



2A//. 



Tm#0 : A^x -5 < -log^ < ^l'^+5,Q<-\og'- < Af;/^^ < log < My + -5, < log < }. 



Since jix ~ 1 and ^iy ~ '^'^ ^o^' large 
Ps ■- 



1 






# 


1 






# 


k 









^(fe) 



X. 



J- 



X. 



(rfecxp{-(p«-5)}l) 



X 



V e^^ < 



(fe) 



< 



^(fe) 



{k) 



V e 



(fc) 



A e'' 



(fe) 



Apply (|6.4p and (|6.5p and we find 

Since ji'^ k, 1 and /iy^ « 1/a, by picking Ai^ large and 5 small, the right side can be made to be less than 1. 



This completes the proof. 



□ 
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